首页> 外文OA文献 >Text Independent Automatic Speaker Recognition System Using Mel-Frequency Cepstrum Coefficients and Gaussian Mixture Models

【2h】

Text Independent Automatic Speaker Recognition System Using Mel-Frequency Cepstrum Coefficients and Gaussian Mixture Models

机译：基于Mel倒谱系数和高斯混合模型的文本独立自动说话人识别系统

代理获取

本网站仅为用户提供外文OA文献查询和代理获取服务，本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文，但由于OA文献来源多样且变更频繁，仍可能出现获取不到、文献不完整或与标题不符等情况，如果获取不到我们将提供退款服务。请知悉。

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

The aim of this paper is to show the accuracy and time results of a text independent automatic speaker recognition (ASR) system, based on Mel-Frequency Cepstrum Coefficients (MFCC) and Gaussian Mixture Models (GMM), in order to develop a security control access gate. 450 speakers were randomly extracted from the Voxforge.org audio database, their utterances have been improved using spectral subtraction, then MFCC were extracted and these coefficients were statistically analyzed by GMM in order to build each profile. For each speaker two different speech files were used: the first one to build the profile database, the second one to test the system performance. The accuracy achieved by the proposed approach is greater than 96% and the time spent for a single test run, implemented in Matlab language, is about 2 seconds on a common PC.

机译：本文的目的是展示基于梅尔倒谱倒谱系数（MFCC）和高斯混合模型（GMM）的文本独立自动说话人识别（ASR）系统的准确性和时间结果，以便开发安全控制检修门。从Voxforge.org音频数据库中随机提取了450个说话者，使用频谱相减法改善了他们的话语，然后提取MFCC，并通过GMM对这些系数进行统计分析以建立每个配置文件。对于每个发言人，使用了两个不同的语音文件：第一个用于建立配置文件数据库，第二个用于测试系统性能。所提出的方法所实现的准确性大于96％，并且在Matlab语言上实现的单次测试所花费的时间在普通PC上约为2秒。

著录项

作者
Alfredo Maesa; Fabio Garzia; Michele Scarpiniti; Roberto Cusani;
展开▼
作者单位

展开▼
年度 2012
总页数
原文格式 PDF
正文语种 eng
中图分类

相似文献

外文文献
中文文献
专利

1. Text Independent Automatic Speaker Recognition System Using Mel-Frequency Cepstrum Coefficient and Gaussian Mixture Models [J] . Alfredo Maesa, Fabio Garzia, Michele Scarpiniti, Journal of Information Security . 2012,第4期

机译：基于Mel倒谱系数和高斯混合模型的文本独立说话人自动识别系统
2. Text-independent speaker identification system based on the histogram of DCT-cepstrum coefficients [J] . S. Al-Rawahy, A. Hossen, U. Heute International Journal of Knowledge-Based in Intelligent Engineering Systems . 2012,第3期

机译：基于DCT倒谱系数直方图的文本无关说话人识别系统
3. Robust text-independent speaker identification using Gaussian mixture speaker models [J] . Reynolds D.A., Rose R.C. IEEE Transactions on Speech and Audio Proceeding . 1995,第1期

机译：使用高斯混合说话人模型进行鲁棒的与文本无关的说话人识别
4. Vector Quantization In Text Dependent Automatic Speaker Recognition Using Mel-frequency Cepstrum Coefficient [C] . AHSANUL KABIR, SHEIKH MOHAMMAD MASUDUL AHSAN WSEAS International Conferences . 2007

机译：矢量量化在文本依赖性自动扬声器识别中使用熔融频率综合扬声器识别
5. Mixtures of inverse covariances: Covariance modeling for Gaussian mixtures with applications to automatic speech recognition. [D] . Vanhoucke, Vincent. 2004

机译：逆协方差的混合：高斯混合的协方差建模及其在自动语音识别中的应用。
6. A Batch Rival Penalized Expectation-Maximization Algorithm for Gaussian Mixture Clustering with Automatic Model Selection [O] . Jiechang Wen, Dan Zhang, Yiu-ming Cheung, 2012

机译：具有自动模型选择的高斯混合聚类的批次竞争惩罚惩罚期望最大化算法
7. Text Independent Automatic Speaker Recognition System Using Mel-Frequency Cepstrum Coefficient and Gaussian Mixture Models [O] . Alfredo Maesa, Fabio Garzia, Michele Scarpiniti, 2012

机译：基于Mel倒谱系数和高斯混合模型的文本独立说话人自动识别系统

Text Independent Automatic Speaker Recognition System Using Mel-Frequency Cepstrum Coefficients and Gaussian Mixture Models

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅